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Abstract. An alternative inference tool for using prior information to calculate marginal distribu- 
tion function in the Bayesian statistics is suggested. A few applications of this new tool are given. 



INTRODUCTION 

Total probability and Bayes formula are two basic tools for using prior information in 
the Bayesian statistics. In this paper we introduce an alternative tool for using prior 
information. This new toold enables us to improve some traditional results in statistical 
inference. However, as far as the authors know, there is no work on this subject, except 
[1]. The results of this paper can be extended to other branches of probability and 
statistics. 

In Section 2 total probability formula based on median is defined and its basic 
properties are proved. A few applications of this new tool are given in Section 3. All 
computations and plots are done using the S-PLUS 1 software system. 



TOTAL PROBABILITY FORMULA BASED ON MEDIAN 

Let X be a continuous random variable with distribution function F x \v(x\v), which 
depends on parameter v with known and continuous density function 7r(.). The marginal 
distribution function X can be calculated by total probability formula, i.e. 

poo 

F x (x)= / F xw (x\u) du (1) 



1 S-PLUS © 1988, 1999 MathSoft, Inc. 



Therefore F x is a weighted mean of F x \ v , i.e., F x is the expected value of F x \ v over 
7T. Our idea for the following definition is similar to (1). 

Definition 1 Let X have a distribution function depending on parameter v, where v 
has a density function 7r(.). The marginal distribution function of X based on median, 
F x (x), is defined as the median of F x \ v {x\v) over n. 

We recall that median is robust with respect to outlier data, but mean is not. To 
simplify calculations of F x (x), we use definition of median in statistics. That is we 
calculate F x (x) by solving the following equation 



f f xw (x\u)(F x (x)) = -, or equivalently P(F x \ v (x\v) < F x (x)) 



1 

2' 



(2) 



The following theorem states an important property of F x (x) . 



Theorem 1 F x (x) is a non-decreasing and continuous function of x. 



Proof: Let x\ < x 2 . For i = l,2, take 



ki = F x (xi) and Yi = F X \ v (xi\u). 



Then using 2 we have 



P(Yi < h) = P(Y 2 < k 2 ) 



1 



2 



We also have 



Y 1 <Y 2 . 




If F x (x) is a non-decreasing function, then 



F x (x_) = limFx-(t) and F x (x + ) = lim F x (t) 




exist and axe finite (e.g. [2]). 

Further, F x \ v (x\v) is continuous with respect to x, and so 



P(F X \ v (x-\v) < F X \ v {xJ)) = P(F xw (x\u) < F x (x-)) 



P(F X \ v (x+\v) < F x (x+)) = P(F x]u {x\u) < F x (x+)). 



And by (2) we have 



P(F X \ v (x\u) < F x {x-)) = P(F xw (x\u) < F x (x)) 

= P(F X \ v (x\v)<F x (x + )). (3) 

If Y — F x \v{x\v) has an increasing distribution function, then 

F x {x„) = F x {x) = F x {x + ) 

and by (3) F x (x) is continuous. 

On the other hand F x (x) is the median of Y, < Y < 1, and so 

< F x (x) < 1. 

Also, by Theorem 1, Fx(+oo) and F x (— oo) ex/*? as 

lim and lim F x (f) 

tt+oo tj— oo 

respectively. Therefore F x (x) is a distribution function if 

F x (+oo) = 1 and F x (-oo) = 0. 
Example 1 Le? X be exponentially distributed, i.e. 

F xw (x\v) = l-e-' /x , x>0 
and assume n(v) = 1, < v < 1. 

In this example we can calculate F x exactly by equation (2) as follows 

P(l-e-"*<F x (x)) = ± 

<=► P(i/ < ^ ^1 -&(*))) = ± 

<=► ^Hl-F x {x)) = \ 
^ F x (x) = l-e- x/2 , x>0. 

It can be shown that F x is a distribution function. Moreover, 

F x (x) = l + -(e- x -l),x>0. 

In some problems we cannot calculate F x exactly. But, we can approximate it in the 
two following cases. 



Algorithm Ml: When F x \ v (x\v) has an analytic form, but we cannot calculate F x 
analytically. 

1 . Fix x (say x ) 

2. Generate K sample for v by using tx(v) (say v k , k — l---,K) 

3. Calculate the sample median of 
F x (x ;vi),--- ,F x (x ;v K ) 

4. Repeat from step 1 with another choice of x. 

Algorithm M2: When F x \ v (x\v) has not an analytic form. 

1. Fix x (say xq) 

2. Generate K sample for v by using (say u k , k = 1 • • • , K) 

3. Generate L sample for X\v k for each k(= 1, ■ • • , K) 
(say iX L \vi)---(xi\v K ,--- ,x L \u K )) 

4. Calculate the empirical distribution function of X\v k for each k(= 1, • • • , K) (based 
on generated samples in the previous step) 

5. Calculate the sample median of empirical distribution functions in step 4 and repeat 
from step 1 with another choice of x. 

Remark 1 We can approximate F x by algorithms similar to Ml and M2 (are called Bl 
and B2 corresponding to Ml and M2). 

Figure 1, shows the graphs of F x , F x , and their approximations for Example 1. 
Example 2 Let X be exponentially distributed, (similar Example 1) i.e. 



F x \ u (x\u) = 1 -e 



;/:/■ 



, x > 



but here 



7r(^) = e 



— v 



v > 0. 



In this case 



F x (x) = l-e 



,zln(l/2) 



, X > 



is a distribution function and 



F x (x) = 1 



1 



x + 1 



, x > 0. 



Figure 2 shows their graphs. 
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FIGURE 1. Graphs of F X ,F X , and their approximations in Example 1 for K = 100, 1000 



100 iterations 




FIGURE 2. Graphs of Fx, Fx, and their approximations in Example 2 for K = 100, 1000 



HYPOTHESIS TESTING 



In this section we introduce a few applications of F x to improve traditional results in 
statistical inference. 

In the previous section we showed that Fx is a distribution function under a few 
conditions. If Fx depends on some unknown parameters we can apply classical methods 
in statistics to make inference about the unknown parameters. For example, uniformly 
most powerful (UMP) test can be calculated by Karlin-Robin theorem [], or the most 
powerful (MP) test can be calculated by the following version of Neyman-Pearson 
lemma's. 

Lemma 1.1 Consider testing 

H -.e = e 

H 1 :e = e 1 ' {) 

where 6 is an unknown parameter of Fx and 6q, 6\ are fixed known numbers. If Fx does 
not depend on any other unknown parameters under H and Hi, then 

, — \o=ei> \e=e Q 

^ = < n , < , d F x{x) , > < 5 > 

u l«=»i< k ^bT~ \e=e Q 

for some k>0, is the MP test of its size for testing. 

Proof: Let Fx (x) = • Then Fx (x) is a continuous density function which does not 
depend on any other unknown parameters under H and Hi. Therefore by the Neyman- 
Pearson lemma (1.1) is the MP test of its size for testing (5). 

Example 3 Consider testing 

H : ii = 

Hi : n < ' ( ' 

based on an observation from a normal distribution X ~ N(fj,, a 2 ). 

If a 2 is known, then the family of normal distribution has Monotone Likelihood Ratio 
(MLR) property and according to the Karlin-Robin theorem 

m*)={j !:::• m 

is the Uniformly Most Powerful (UMP), of size a test function for 6, where P(Z < 
z a ) = a and Z ~ N(0, 1). But, if a 2 is unknown, then the best test does not exist. 

In the Bayesian approach, where a 2 (variance) has a prior density function such as 
uniform or exponential (where defined in Examples 1 and 2 respectively), we can find 
marginal distribution functions F x and F x which depends on p,. Figure 3 shows the 
graphs of power functions of the tests based on Fx and Fx for a = 0.05. We also plot 
the graph of power function of 7 for a = 0.4, 1 (i.e. when a is known). The graphs show 



that the test based on F x is better than the test based on F x , when we use exponential 
prior for a 2 . 

Moreover, we plot the graphs of power functions of the tests based on F x and F x in 
the two cases of uniform and exponential prior distributions for a (standard deviation) 
in Figure 3. The result is incredible! The test based on F x is much better than the test 
based on F x . 



PARAMETER ESTIMATION 

Consider the case where the distribution of X depends on two parameters 9 and v, i.e.,, 
we have F x ^ e (x\v,9). Then, we can define F x \ e (x\9) and F x \e{x\0) as in previous case. 

Then, by derivating them with respect to x, we can also define f x \g(x\6) and f x \e{x\9). 
Assume now that we have a data set x — 1, • • • ,x N where we assume its distribution to 
be F x \ v ,e(x\u,9) and where we have prior knowledge ir(v), and we want to estimate 9 
for this data set. 

The classical MLE is defined by 

= axgmax|L(e)=JI/x| 9 (a; i |0)J (8) 
Similarly, based on our new criterion we propose the following 

9 = axgmax = n/x|*N#) j • (9) 

To show the relative performances of these two estimators, we 
TO COMPLETE LATER 



CONCLUSION 

We introduced an alternative inference tool for using prior information by defining 
a marginal function F x \ v (x\v) which is based on median in place of F x \ v (x\v) which 

is the expected value of F x \ v (x\v) with respect to We proved that F x \ v (x\v) is a 
non-decreasing and continuous function of x and presented some of its applications and 
its performances in hypothesis testing and in parameter estimation. 
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Power functions by using Uniform prior for sigma A 2 




FIGURE 3. The graphs of power functions when the variance has a uniform and exponential prior. 



Power functions by using Uniform prior for sigma 




FIGURE 4. The graphs of power functions, when the standard deviation has a uniform and exponential 
prior. 



